Approximate counting of inversions in a data

نویسندگان

  • T. S. Jayram
  • Ravi Kumar
  • D. Sivakumar
چکیده

Inversions are used as a fundamental quantity to measure the sortedness of data, to evaluate diierent ranking methods for databases, and in the context of rank aggregation. Considering the volume of the data sets in these applications, the data stream model 16, 2] is a natural setting to design eecient algorithms. We obtain a suite of space-eecient streaming algorithms for approximating the number of inversions in a permutation to within a factor of. The best space bound we achieve for this problem is O(log n log log n) through a deterministic algorithm. In contrast, we derive an (n) lower bound for randomized exact computation for this problem; thus approximation is essential. For the more general problem of approximating the number of inversions between two permutations , we obtain a randomized O(p n log n)-space algorithm. For approximating the number of inversions in a general list, we give a randomized O(p n log 2 n)-space two-pass algorithm. In contrast, we derive (n) lower bounds for deterministic approximate computation for these problems; thus randomization is essential. All our algorithms use only O(log n) time per data item. Our result for approximating the number of inversions in a permutation is unique and surprising in the following aspect: all of the existing streaming algorithms require randomization in a crucial way, whereas our algorithms are deterministic!

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New Results for Adaptive and Approximate Counting of Inversions

Counting inversions is a classic and important problem in databases. The number of inversions, K∗, in a list L = (L(1), L(2), . . . , L(n)) is defined as the number of pairs i < j with L(i) > L(j). In this paper, new results for this problem are presented: 1. In the I/O-model, an adaptive algorithm is presented for calculating K∗. The algorithm performs O( B + N B logM/B( K∗ NB )) I/Os. When K∗...

متن کامل

Study of Temperature and Salinity Inversions in the Oman Sea

In this research, observational data are analyzed to explain temperature and salinity inversions and double diffusion in the Oman Sea. The results of this paper show that intrusive structures extend over a depth of about 100-450 meters in the Oman Sea. Temperature and salinity inversions observed depth of 250 m in winter and summer in the Oman Sea. Steps thickness is about 10 to 20 meters. A wa...

متن کامل

Counting Inversions Adaptively

We give a simple and efficient algorithm for adaptively counting inversions in a sequence of n integers. Our algorithm runs in O(n+

متن کامل

Study of Temperature and Salinity Inversions in the Oman Sea

In this research, observational data are analyzed to explain temperature and salinity inversions and double diffusion in the Oman Sea. The results of this paper show that intrusive structures extend over a depth of about 100-450 meters in the Oman Sea. Temperature and salinity inversions observed depth of 250 m in winter and summer in the Oman Sea. Steps thickness is about 10 to 20 meters. A wa...

متن کامل

Periodic Oscillations in the Analysis of Algorithms and Their Cancellations

A large number of results in analysis of algorithms contain fluctuations. A typical result might read “The expected number of . . . for large n behaves like log2 n + constant + delta(log2 n), where delta(x) is a periodic function of period one and mean zero.” Examples include various trie parameters, approximate counting, probabilistic counting, radix exchange sort, leader election, skip lists,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002